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Systematic uncertainties in the hght-element abundances and their evolution make a 
rigorous statistical assessment difficult. However, using Bayesian methods we show that the 
P_|. following statement is robust: the predicted and measured abundances are consistent with 
95% credibility only if the baryon-to-photon ratio is between 2 x 10~^° and 6.5 x 10~^° and 
the number of light neutrino species is less than 3.9. Our analysis suggests that the ^He 
^ I abundance may have been systematically underestimated. 



Big-bang nucleosynthesis occurred seconds after the bang and for this reason offers the 
most stringent test of the standard cosmology. Comparison of the predicted and measured 
light-element abundances has evolved dramatically over the past thirty years, beginning with 
the observation that there was evidence for a significant primeval abundance of ^He which 
could be explained by the big bang to the present where the abundances of D, ^He, "^He 
and ^Li are all used to test the big bang. 

The predictions of big-bang nucleosynthesis depend upon the baryon-to-photon ratio 
(= T]) as well as the number of light (< 1 MeV) particle species, often quantified as the 
equivalent number of massless neutrino species (= N,y). For a decade it has been argued 
that the abundances of all four light elements can be accounted for provided rj is between 
2.5 X 10"^° and 6 x 10"^° and < 3.1 - 4 fl^, |18|, The "consistency interval" provides 
the best determination of the baryon density and is key to the case for the existence of 
nonbaryonic dark matter. The limit to provides a crucial hurdle for theories that aspire 
to unify the fundamental forces and particles. 

However, these conclusions were not based upon a rigorous statistical analysis. Because 
the dominant uncertainties in the light-element abundances are systematic such an analysis 
is difficult and previous work focussed on concordance intervals. Given the importance of 
big-bang nucleosynthesis it is worthwhile to try to use more rigorous methods. Here we 
apply two standard methods, goodness of fit and Bayesian likelihood, and identify the the 
conclusions which are insensitive to the systematic errors. 

We begin by reviewing the general situation. The predictions of standard big-bang nu- 
cleosynthesis are shown in Fig. 1. The theoretical uncertainties are statistical, arising from 
imprecise knowledge of the neutron lifetime and certain nuclear cross sections. Because of 10 
Gyr or so of "chemical evolution" since the big bang (nuclear reactions in stars and elsewhere 
which modify the light-element abundances) determining primeval abundances is not simple 
and the dominant uncertainties are systematic. 

The chemical evolution of '^He is straightforward: stars make additional "^He. Stars also 
make metals (elements heavier than ^He); by measuring the ^He abundance in metal-poor, 
extragalactic HII (ionized hydrogen) clouds as a function of some metal indicator (e.g., 
C, N or O) and extrapolating to zero metallicity a primeval abundance has been inferred: 
Yp = 0.232 ± 0.003 (stat) ± 0.005 (sys) pO|. Systematic uncertainties arise from trying to 



convert line strengths to abundances by modeling. The range Yp = 0.221 — 0.243 allows 
for 2a statistical + la systematic uncertainty and is consistent with the big-bang prediction 
provided rj ~ (0.8 — 4) x 10~^° |l^. Others have argued that the systematic uncertainty is 



a factor of two or even three larger |£1|; taking Yp ~ 0.21 — 0.25 increases the concordance 
range significantly, 1] ~ (0.6 — 10) x 10~^°, reflecting the logarithmic dependence of big-bang 
^He production upon r] [ll7| . 

There is a strong case that the ^Li abundance measured in metal-poor, old pop II halo 
stars, ^Li/H = (1.5 ± 0.3) x 10~^°, reflects the big-bang abundance [|22|. However, it is 
possible that even in these stars the ^Li abundance has been reduced by nuclear burning, 
perhaps by a factor of two (the presence of ^Li in some of these stars, which is more fragile, 
provides an upper limit to the amount of astration). Allowing for a 2a statistical uncertainty 
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and for up to a factor of two astration,Q leads to the consistency interval rj = [1 — 6) x 10 

The interpretation of D is particularly challenging because it is burned in virtually all 
astrophysical situations and its abundance has only been accurately measured in the solar 
vicinity. Because D is destroyed and not produced |]23| and because its abundance is so 
sensitive to t] (D/H oc rj'^''^), a firm upper limit to r] can be obtained by insisting that 



big-bang production account for the D observed locally, D/H > (1.6 ±0.1) x 10^ p4|. This 
leads to the two-decade old bound, rj < 9 x 10~^°, which is the linchpin in the argument 
that baryons cannot provide closure density [|25l. Because D is readily destroyed, it is not 
possible to use the D abundance to obtain a lower bound to rj. The sum of D and ^He is 
more promising: D is first burned to ^He, and '^He is much more difficult to burn. On the 
assumption that the ^He survival fraction is greater than 25% the lower limit rj > 2.5 x 10^^'^ 
has been derived [19|. The D, ^He concordance interval, rj ~ (2.5 — 9) x 10"^'^. 

The overlap of the concordance intervals (see Fig. 1), which occurs for t] = (2.5—6) x lO^^'', 
is the basis for concluding that the light-element abundances are consistent with the big-bang 
predictions |]T7 . 



The dominant uncertainties in comparing the predicted and measured light-element abun- 
dances are systematic: the primeval abundance of '^He; the chemical evolution of D and ^He; 
and whether or not ^Li in the oldest stars has been reduced significantly by nuclear burning. 
Systematic error is difficult to treat as it is usually poorly understood and poorly quantified. 
(If it were understood and were well quantified it wouldn't be systematic error!) This is 
especially true for astronomical observations, where the observer has little control over the 
object being observed. 

There are at least three kinds of systematic error. (1) A definitive, but unknown, offset 
between what is measured and what is of interest. (2) A random source of error whose 
distribution is poorly known. (3) An important source of error that is unknown. The first 
kind of systematic error is best treated as an additional parameter in the likelihood function. 
The second kind of systematic error is best treated by use of a distribution, or by several 
candidate distributions. The third type of systematic error is a nightmare. 

The data themselves can clarify matters. Consider ^Li; its measured abundance in old 
pop II stars is equal to the primeval abundance with a small statistical error and a larger 
systematic uncertainty due to astration. This could be a systematic error of the first kind — 
if all stars reduce their ''Li abundance by the same factor — or of the second kind — if the 
^Li abundance in different stars were reduced by different amounts. In the latter case, the 
measured ''Li abundance should show a large dispersion — which it does not [^. Thus, we 
treat astration by considering two limiting possibilities: ^Li/H= (1.5 ± 0.3) x 10^^'^ (no 
astration); and second, ^Li/II= (3.0 ± 0.6) x 10^^° (astration by a factor of two). 

Several sources of systematic error for ^He have been identified — dust absorption, neu- 



^Additional smaller, but important, systematic uncertainties arise from the modeling of stellar atmo- 
spheres, which is needed to convert line strengths to abundances, and possible enhancement of ''Li by 
cosmic-ray production We use astration to illustrate the effects of systematic uncertainty in the ^Li 

abundance, but it serves to illustrate the point for the other effects too. 
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tral ^He, stellar absorption, and theoretical emissivities — which can reduce or increase the 
measured abundance pT] . If the same effect dominates in each measurement use of an offset 
parameter in the ^He abundance would be appropriate. On the other hand, if different effects 
dominate different measurements enlarging the statistical error would be appropriate. We 
allow for both: the statistical error ay is permitted to be larger than 0.003, and an offset in 
the ^He abundance, AF, is a parameter in the likelihood function (Yp = 0.232 + AY). 

Finally, there is the systematic uncertainty associated with the chemical evolution of D 
and ^He. Based upon a recent study of the chemical evolution of D and ^He [26| we consider 
three models that encompass the broadest range of possibilities: Model is the plain, vanilla 
model; Model 1 is characterized by extreme ^He destruction^ (average ^He survival factor 
of about 15%); and Model 2 is characterized by minimal ^He destruction. The likelihood 
functions for these three models are shown in Fig. 2. 

First consider the test for goodness of fit. This technique is best suited to situations 
where the errors are gaussian and well understood and there are many degrees of freedom; 
neither apply here. Nonetheless, in Fig. 3 we show x^iv) for eight different assumptions about 
the systematic uncertainties: (1,5) ay = 0.003, AY = 0; (2,6) ay = 0.01, AY = 0; (3,7) 
ay = 0.003, AY = 0.01; (4,8) ay = 0.01, AY = 0.01. In (l)-(4), ^Li/H = (3 ± 0.6) x 10-^°; 
and in (5)-(8) '^Li/H = (1.5 ± 0.3) x 10~^°. For clarity, only the results for Model are 
shown, the results for Models 1 and 2 are similar. 

Several conclusions can be drawn from Fig. 3. First, the goodness of fit depends sensi- 
tively upon assumptions made about the systematic errors, with the minimum ranging 
from 6 to much less than 1; it is smallest when a systematic shift in ^He is allowed and/or 
ay is increased. Second, in all cases the the rj interval defined by Ax^ = 3 (from the mini- 
mum x^) has a lower bound no lower than about 1.5 x 10~^° (in all but (5), no lower than 
2.5 X 10~^°) and an upper bound no higher than about 6 x 10~^°. 

Next, we turn to Bayesian likelihood, which is best suited to determining parameters of 
a theory or assessing the relative viability of two or more theories. Since there is no well 
developed, alternative to the standard theory of nucleosynthesis at present, likelihood is of no 
use in assessing relative viability. Systematic errors of the first kind are treated as additional 
(nuisance) parameters in the likelihood function which can be determined by the experiment 
itself or can be eliminated by marginalization; we treat AY as such. We also allow ay to vary 
to study how results depend upon the assumed uncertainty in the "^He abundance. Because 
we are interested in setting a limit to N^, it too is taken to be a parameter. Values of N^j 
greater than three describe extensions of the standard model with additional light degrees 
of freedom. 

In Fig. 4 we show contours of C{N^, AY,ay = 0.003). The contours are diagonal lines 
because AY and N^, are not independent parameters — the primary effect of an increase in 

is an increase in the predicted ^He abundance (AYp ~ 0.01 AN^). A likelihood function 
that is not compact must be treated with care, because no information about the parameters 
(here, N^, and AY) can be inferred independently of what was already known (the priors). 

^Because the stars that destroy ^He also make metals, it is not possible to destroy "^He to an arbitrary 
degree without overproducing metals M . 
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|Ar| Model Model 1 Model 2 

3.1/3.1 3.2/3.3 3.1/3.1 

0.005 3.2/3.1 3.3/3.3 3.2/3.1 

0.010 3.3/3.2 3.5/3.3 3.3/3.2 

0.015 3.5/3.4 3.7/3.4 3.4/3.4 

0.020 3.7/3.6 3.9/3.7 3.7/3.6 

Table 1: Limits to A^^ for Models 0, 1, 2 and Li/H = (1.5 ± 0.3) x 10"^° (first number) and 
Li/H = (3.0 ± 0.6) X 10-1° (second number). 

For example, the likelihood function C{Ny), which is needed to set limits to N^, is 
obtained by integrating over Ay and depends upon the limits of integration. To derive 
limits to Ny we do the following: integrate from — |Ay| to |Ay|; normalize C.{Ny) to have 
unit likelihood from N^j = 3 to cxd; the limit is the value of Ny beyond which 5% of the total 
likelihood accumulates. The dependence of the limit upon |Ay| is shown in Table 1. 

An aside; in a recent paper the likelihood function C{N^) obtained by integrating from 
AY = —0.005 to 0.005 was used in an attempt to assess the viability of the standard theory 



of nucleosynthesis ^^l- This likelihood function is peaked at = 2 and is approximately 
gaussian with ctat^ = 0.3. On this basis it was claimed that the standard theory of nu- 
cleosynthesis is ruled out with 99.7% confidence. By so doing equal weight was implicitly 
given to all values of N^, (uniform priors). The prior for N^, = 3 (the standard model of 
particle physics) is certainly orders of magnitude greater than that for = 2 (for which no 
well developed model exists). The likelihood function C{Ny) which properly included prior 
information would certainly not be peaked at N^, = 2.0. 

In Figs. 5 and 6 we show the 95% contours of the likelihood functions C{ri, ay) and 
£(?7, Ay) for Models 0, 1, and 2 and both values of the central ''Li abundance. Both figures 
suggest the same thing: the uncertainty in the primordial ^He abundance has been underes- 
timated. In the <7y — f] plane ay = 0.003 does not intersect the 95% credibility contour, and 
in the AY — rj plane AY = does not intersect the 95% credibility region (except for Model 
1, where they barely do). The 95% credibility contour in the ay — 1] plane becomes inde- 
pendent of ay for ay > 0.008, with 95% credibility interval r/ ~ (3 — 6.5) x 10~^^ (allowing 
both for the uncertainty in the astration of ^Li and in the chemical evolution of D and ^He). 

The 95% credibility contours in the AY—rj plane suggest that the primeval ^He abundance 
has been systematically underestimated, by an amount AY ^ +0.01. (Though it should be 
noted that Model 1 and the lower ^Li abundance are just consistent with AY = at 95% 
credibility.) Put another way, D, ^He, and ''Li are concordant and ^He is the outlayer. (This 
can also be seen in Fig. 2.) When the likelihood function is marginalized with respect to 
Ay, the 95% credibility interval is r/ ~ (2 — 6.5) x 10"^° (allowing again for the uncertainty 
both in astration of ''Li and in the chemical evolution of D and ^He). 

To conclude, the fact that systematic uncertainties dominate precludes crisp statistical 
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statements. The lack of a viable alternative to the standard theory of nucleosynthesis com- 
plicates matters further as the most powerful statistical techniques assess relative viability. 
However, the rigorous techniques that we have applied point to several conclusions that are 
insensitive to assumptions made about systematic uncertainty: 

• The predictions of the standard theory of primordial nucleosynthesis are only consistent 
with the extant observations with 95% credibility provided r/ ~ (2 — 6.5) x 10~^°. 

• Our analysis suggests that the primordial ^He abundance has been systematically un- 
derestimated (Ay ^ +0.01) or that the random errors have been underestimated 
{ay ~ 0.01). Only for Model 1 (extreme destruction of ^He) are AY = and 
ay = 0.003 in the 95% credibility region (cf., Figs. 5 and 6). 

• The limit to N,^ depends upon the systematic uncertainties in the ^He abundance (cf., 
Table 1); taking |Ay| < 0.02, which is four times the estimated systematic error and 
also encompasses the 95% likelihood contour in the AY — t] plane, leads to the 95% 
credible limit Ni, < 3.9. 



This more rigorous analysis provides additional support for the conclusions reached pre- 
viously about the concordance interval for t] [|T^ . The limit to the number of neutrino species 



is less stringent than previously quoted bounds |jT^, |T8| because we allowed a chemical evo- 
lution model with the most extreme destruction of ^He (which permits low values of 7] where 
^He production is lower) as well as a large systematic offset in the ^He abundance. 

Finally, there are two measurements that should reduce the systematic uncertainties sig- 
nificantly, permitting a sharper test of big-bang nucleosynthesis. The first is a determination 
of the primeval D abundance by measuring D-Lya absorption due to high-redshift hydrogen 
clouds. The second is a determination of the primeval ^Li abundance by studying short 
period, tidally locked pop II halo binaries; astration is believed to involve rotation-driven 
mixing astration and is minimized in these stars because they rotate slowly At the 



moment, there are conflicting measurements and upper limits for the primeval D abundance 
seen in high-redshift hydrogen clouds [^], and there is one study which indicates that the 
^Li abundance in short-period binaries is no higher (evidence against significant astration) 
and another that finds weak evidence that the ''Li abundance is higher |3^ . 
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Figure Captions 



Figure 1: The predicted light-element abundances (with 2a theoretical errors); rectangles 
indicate consistency intervals, which all overlap for 77 = (2.5 — 6) x 10~^°. 

Figure 2: Likehhood functions for D and ^He (lower solid curves, from left to right: Models 
1, 0, and 2), ^He (dotted curves, from left to right: cry = 0.01, ay = 0.003, and = 0.01), 
and ^Li (broken = high ^Li, solid = low ^Li). 

Figure 3: Reduced as function of ri for eight different sets of assumptions about the 
systematic uncertainties (see text for details). 

Figure 4: The 5% of maximum likelihood contours for C{Ny^ AF, ay = 0.003) (solid curves 
= low ^Li, broken curves = high ^Li). Because A'^^ and AY are not independent parameters, 
the contours of likelihood are diagonal lines and the likelihood function is not compact. 

Figure 5: The likelihood function C{ay,r], AY = 0) (solid curves = low ^Li, broken curves 
= high ^Li). 

Figure 6: Same as Fig. 5 for C(AY,ri,ay = 0.003). 
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